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ABSTRACT 


A  considerable  volume  of  recent  research  work  has  been  devoted 
to  mixtures  of  discrete  distributions.  The  purpose  of  this  thesis  is  to 
obtain  estimates  of  the  parameters  of  the  mixed-Poisson  distribution  and 
also  to  study  their  mathematical  properties. 

In  Chapter  1,  some  recent  work  in  these  fields  is  reviewed, 
namely  the  work  of  Gumbel  [15],  Rider  [26]  and  Blischke  [4].  Chapter  II, 
contains  a  detailed  discussion  of  the  derivation  of  the  mixed  Poisson 
distribution.  In  Chapter  III,  moment  estimators  for  a  mixture  of  two 
Poisson  distributions  are  obtained  and  their  asymptotic  properties  discussed. 
Chapter  IV,  contains  a  variety  of  results  which  are  obtained  as  a  by  product 
of  our  approach. 
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CHAPTER  I 


Introduction; 

Mixtures  of  distributions  have  received  an  increasing  amount 
of  attention  in  the  recent  statistical  literature,  partly  because  of 
interest  in  the  mathematical  aspects  of  mixtures  and  partly  because  of 
the  considerable  number  of  applied  problems  in  which  mixtures  are 
encountered. 

The  dissection  of  mixed  frequency  distribution  is  often  very 
complicated  [16].  This  is  certainly  true  for  a  mixture  of  two  normal 
distributions,  which  was  studied  by  Karl  Pearson  [22];  perhaps  the  earliest 
investigation  of  the  dissection  problem.  Pearson  was  led  to  an  equation 
of  ninth  degree  the  setting  up  and  solution  of  which  involved  a  tremendous 
amount  of  calculation.  This  calculation  could  doubtless  be  performed 
rather  easily  to-day  by  a  high  speed  computer.  (But  when  his  paper  was 
published  in  l89^>  it  was  extremely  laborious.) 

Mixtures  of  distributions  present  two  types  of  problems.  The 
f  irst  is  the  problem  of  identif lability,  that  is,  given  that  a  distribution 
function  F  is  a  probability  mixture  of  distribution  functions  belonging 
to  some  family  3"  >  is  the  mixture  unique? 

This  topic  has  been  dealt  with  quite  extensively  in  recent 
papers  by  Robbins  [27],  Teicher  [30]  and  others.  The  second  problem  is 
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that  of  estimating  the  parameters  of  the  individual  distribution  functions 
comprising  the  mixture  (and  mixing  measure).  This  is  clearly  possible 
only  if  the  given  mixture  is  identifiable.  (Pearson  [22],  C.  R.  Rao  [23] 
and  [24].) 

Gumbel  [ 1 5 ]  solved  the  dissection  problem  for  frequency 
distributions  which  are  the  sums  of  two  asymmetrical  exponential 
distributions,  or  of  mixed  Poisson  distributions  but  in  both  cases, 
he  restricted  himself  to  the  case  in  which  the  proportion  contributed 
to  the  sum  by  each  component  is  known. 

Schilling  [29]  discussed  extensively  Chi-square  test  of  the 
hypothesis  that  an  observed  distribution  corresponds  to 

(1)  a  Poisson  distribution, 

(2)  a  weighted  average  of  mixed  Poisson  distributions. 
Applications  are  made  to  the  examples  given  by  F.  Thorndike  [33] • 

In  1961,  Gumbel * s  theory  was  extended  by  Rider  [25]  to  mixed 
exponential  distributions  and  mixed-Poisson  distributions. 
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CHAPTER  II 


2.1  Introduction; 

In  this  Chapter,  we  will  discuss  the  method  of  extimating  the 
parameters  of  mixed  distributions  by  using  sample  moments.  This  method 
has  been  outlined  by  Gumbel  [15],  who  discussed  the  general  dissection 
problem.  He  has  shown  how  the  method  of  moments  can  be  used  to  estimate 
the  parameters  of  a  mixed  distribution.  He  applied  the  method  to  mixed 
exponential,  and  mixed  Poisson  distributions,  but  assumed  that  the 
proportion  p  is  known.  Gumbel' s  theory  was  extended  by  Rider  [25]  to 
a  mixture  of  two  exponential  distributions.  The  same  method  is  applied 
here  to  a  mixture  of  two  Poisson  distributions. 

Let  us  consider  a  population  that  is  a  mixture  of  two  Poisson 
populations.  Such  an  example  would  be  a  batch  of  manufactured  articles 
mixed  from  two  lots  with  different  fractions  defective.  Another  example 
would  be  the  counts  of  bacterial  colonies  on  the  squares  of  a  hemocy tome ter 
where  different  types  of  bacteria  have  been  mixed. 

To  be  definite,  let  X. ,  X  ,  ...,  X  be  independent  and 
identically  distributed  random  variables,  each  having  the  distribution 
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where 

o  <  p  <  i  ;  A,ji  >  o  . 

We  shall  refer  to  the  distribution  (2.1.1)  as  the  mixed-Poisson  distribution. 
A  and  p.  are  the  parameters  of  two  Poisson  distributions,  which  have 
been  mixed  in  the  unknown  proportions  of  p  and  ( 1 - p )  . 


2.2  Mathematical  Properties  s 

When  investigating  random  variables  which  take  on  only  the. 
integers  x  =  0,  1,  2,  ...  it  is  simpler  to  deal  with  probability 
generating  functions  than  with  characteristic  functions. 

Let  X  be  a  random  variable  and  let 


p^  =  p(X  =  x)  (x  =  0,1,2,...)  , 


then 


The  probability  generating  function  (p.g.f.)  of  X  is  defined  by 


V'(s)  =  y  pxsx 


(-1  <  s  <  l) 


we  notice  that 


V'(I)  = 
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Hence  the  series  on  the  right-hand  side  is  absolutely  and  uniformly 
convergent  in  the  interval  |s j  <  1  .  Thus  the  generating  function  is 
continuous.  It  determines  the  probability  function  uniquely,  since 
ipi s)  can  be  represented  in  a  unique  way  as  a  power  series  of  the  form 


V'(s) 


00 


x=o 


(-1  <  S  <  1) 
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P(X  =  x)  =  p  t-S—  +  (1-p)  t-ii-  (x  -  0,1,2,...)  • 

A  #  A  9 

The  probability  generating  function  (p.g.f.)  of  the  mlxed-Poisson 
distribution  is  given  by 


V'(s)  = 


X*. 


(1-p) 
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The  moments  of  the  random  variable  X  can  be  determined  by  the  derivatives 
at  the  point  1  of  the  gemreating  function.  Let  us,  for  example,  deter¬ 
mine  the  moments  of  the  first  and  second  order.  We  have 


r(s)  = 
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Hence 


i/'  (i)  -  y  *px  =  e(x) 


f(l)  =  ^*(*-1)  Px  -  E(X2)  -  E(X) 

X 


we  then  obtain 


E(X2)  =  ^"(1)  +  f'(l) 


2.3  The  Moment  Generating  Function  of  The  MixecUFoisson  Distribution 

Since 


where 


tx 


CO 
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i! 
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K  - 1 


The  function 


m(t)  =  E(etX) 


is  called  the  moment  generating  function  (m.g.f.)  of  x  .  If  m(t) 
differentiated  k-times  with  respect  to  t  and  then  evaluated  at  t 
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we  note  that 


jL-SU) 

atk 


=  K 


t-o 


to  obtain  the  moment  genrating  function  of  x-p,  ,  we  consider 


2  p 

t(x-^)  t(x-n)  t-(x-u)" 

t  =14-  -  ,  +  —  o  t  —  +  • 

1  9  O 


then 
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.  i  + 


i  i! 


if  we  set, 


then 


M(t)  -  K  Lt(X-*i)]-  , 


ak  M(t) 

5tk 


t=o 


The  moment  generating  function  of  any  function  @(x)  may  be  defined  as 


M(t)  =  E  -je 


t@  ( x ) 


Since  it  may  easily  be  seen  that 
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where  (a1  is  the  k-th  moment  of  @(x)  about  the  origin. 


In  the  case  of  the  mixed-Poisson  distribution,  we  have 


m(t )  -  E {etX} 


00 


Z 

x=o 


tx 

*s  >  e  p, 


X 


Thus 


oo 


Z 

X=Q 


e 


tx 


\x  -A  x  -LL 

P  f  +  (1-p)  — t — 
1  x!  xl 


=  pe-X(l-et)  +  (1-p)  e^(1-et) 


m(t)  =  p  e*^1*®*)  +  (1-p) 


(2.3.1) 


and  putting  1-e  =  6  in  (2.3«l)>  we  get 


m{t)  =  p  e  ^  +  (1-p)  e  ^ 


we  may  obtain  the  following  relation  between  M(t)  and  m(t) 
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M(t)  =  e  t^i  m(t) 


Hence 


*  0  if'_ 


' 

?v  3ji  »vr  ,  i  J:~j  .  lil  i  b  ao ft* -t d  ■  h  ®  ©  '3  lo  .»••’»• 


M  > 


(  ©,}  3  »  (j)w 


:- 


x3 


c*x 


atyfl 


(i.t.s) 


9  q 


(i)fli 


3»g  rtr  ,(I,.£.S)  nJt  a  •  3-1  sni33«q  bfl© 


.• 


s  (q-X,1  +  ©  q  *  3;« 


(a'®  wh  (j)Hf  Bss^ifi  &fllvotlol  art  1  nlfciti  v>«r  f* 

■ 


»  | 


J4*T,  . 


/ 


9 


SOflSrf 


(*)» 


lj3- 


»  * 


(?)<f 


“  9  - 


2,4  Mean  and  Variance  of  The  Mixed-Poisson  Distribution 
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Hence 


=  p  A  +  (1-p)  p 


mean  =  p  A  +  (l-p)  p 


m2  =  E(X  ) 
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=  p  A2  +  (l-p)  (I2  +  P  A  +  (1-p)  p 

Hence, 

=  p  A2  +  (l-p)  p2  +  p  A  +  (l-p)  p  - 
-  [P  A  +  (l-p)  p]2 

=  p(i-p)  [(a-p)2  +  +  p  ]  , 

similarly 

m^  =  E(X5) 


00 


x=o 


oo 

=  ^  [x(x-l)(x-2)+3x(x-l)+x]  px 

x=o 

=  p  A5  +  (l-p)  +  3pA2  +  3(l-p)  p2  + 

+  p  A  +  (l-p)  p  . 

Hence, 

m^  =  p(A+3A2+A^)  +  (l-p)(p+3p2+p5)  . 


In  the  mixed-Poisson  distribution,  we  can  easily  see  that  the 
two  requirements  for  a  function  of  a  random  variable  to  be  a  probability 
density  function  are  satisfied  by  noting  that 
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00 


(2)  ^  p(x;  A,p,p)  =  l 


x=o 


where  p(xj  A,p,p)  is  the  probability  density  function  of  the  mixed-Poisson 
distribution.  The  comulative  distribution  function  of  X  is  given  by 


F(x;  A,p)  =  ^  p(y;  A,plP)  if  x  =  0,1,2, 


7=0 


2 . 5  Method  of  Moments : 

A  method  of  estimating  the  parameters  of  mixed  disbributions  by 
using  sample  mcaiehts  has  been  outlined  by  Cumbel  [15],  who  considered  mixed 
exponential  and  mixed-Poisson  distributions.  The  method  is  applied  here 
for  estimating  the  parameters  of  the  mixed-Poisson  distribution. 

i 

Consider  the  frequency  distribution  of  the  mixed-Poisson 
dis  tribut ion 


f(x)  =  p  —  ^  +  (1-p)  &—S. j- 

Let  m2  ,  ml  and  ml  be  the  first  three  moments 
12  5 

sample  from  mixed-Poisson  distribution,  and  let 
the  estimators  of  p,  A;  and  p  respectively, 


(2.5.1) 


about  the  origin,  of  a 

*  *  * 

p  ,  A  and  p  denote 

obtained  by  the  method  of 


moments . 


Then 
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*/  *  -v*2s  /  *w  *  *2x  /  X 

P  (A  +  A  )  +  (1-p  )(pi  +  pi  )  =  m^  (2.5.3) 

p  (A  +  3A  +  A  :>)  +  (l-p  )(pi  +  3|i  +  pi  *  m^  (2.5.4) 


*  .  *  *.  * 
p  (A  -  pi  )  =  m'  -  pi 


From  (2.5.3) 


*  *  *  *  *  *  *2 

p  (A  -  pi  )(1  +  A  +  pi  )  =  -  pi  -  pi 


(2.5.5) 


(2.5.6) 


From  (2.5.4) 
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*  *  *?, 

+  A  (i  +  pi  ]  = 


*  *2  *3 

=  m^  -  pi  -  3|i  -  (a 


(2-5.7) 


Substituting  the  expression  for  p  (A  -  pi  )  from  (2.5*5)  into 
(2.5.6)  and  (2.5.7)  leads  to  the  following  equations. 
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*  *  * 


m|  +  m^  (A  +  p  )  -  A  p  =  m^ 


*  * 


*  * 


m[  [  1  +  (A  +  [i  )]  -  A  p  = 


m*  [1  +  S  ]  -  P 


=  m' 


(2.5.8) 


where 
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From  equation  (2.5.7) 
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r  *  *p  * 

mj[l+3S  +  S  -  P  ] 


*  *  * 

3P  -  S  P  =  mi 

j 


(2.5.9) 


where 


*  *  * 
S  =  A  +  (i 


*  *  * 
P  =  A  p 


The  solutions  of  (2.5.8)  and  (2.5.9)  are  readily  found  to  be 
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+  2m’  -  m’  m^  -  5m^  +  m^ 


ml  -  m 
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(2.5.10) 


and 


,2 


* 

P  = 


m'  -  m'  m'  +  m*  ml  -  m_ 

1_ 1  2_ 1  j_ 2 


2 
» c- 


*2 
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The  estimates  A  and  p  are 


1  *  1  „  *2  ,  *, 1/2 
±  S  ±|[S  *  -  4p  Y/d 


(2.5.12) 


Finally,  the  estimator  p  can  be  found  from  (2.5.5) 


Remarks ;  (see  Rider  [25]) 
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(2.5*5),  will  refer  to  the  component  having  A  as  a  parameter  and  (l-p  ) 
will  refer  to  the  other  component. 


(2)  From  continuity  considerations,  it  is  seen  that  imaginary  roots  may 

occur  with  positive  probability.  However,  if  A  p  ,  the  proposed 

*  * 

estimators  are  consistent,  and  the  probability  that  A>0,  p  >  0  , 

* 


0  <  p  <1  approaches  1  as  n  .  This  follows  from  the  fact  that, 

in  this  case,  the  estimators,  regarded  as  functions  of  (mj,  m^,  m^)  , 

are  continuous  at  the  point  (p',  p',  pi)  ,  where  the  pi’s  are  the 

1 

*  *  *  .  . 
population  moments,  and  that  A>0,  0  <  p  <  1  if  (m|,  m^,  m^) 

is  sufficiently  close  to  (p|,  p)>,  p^,)  . 


(5)  If  A  =  n  =  v  ,  the  behavior  of  the  estimators  changes  radically. 
Some  discussion  of  the  variances  of  the  three  estiamtors  is  of  course 
in  order.  It  seems  that  the  calculation  of  these  variances,  even  in 
asymptotic  form,  would  not  only  be  a  difficult  task  but  would  lead  to 
somewhat  complicated  expressions.  However,  to  simplify  matters  and  to 
give  some  idea  of  the  reliability  of  the  estimators  A  and  p  ,  it 
will  be  temporarily  assumed  that  p  is  known.  With  this  assumption, 
only  two  sample  moments  are  needed  to  estimate  A  and  p  . 

(4)  From  equations  (2.5.2)  and  (2.5.5)  ft  is  found  that 


■& 

where,  as  usual,  q  =  l-p  and  p  is  equal  to  the  same  expression  with 
a  minus  sign  between  the  two  terms  on  the  right-hand  side. 
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(5)  The  plus  sign  is  used  if  A  >  p.  ,  the  negative  sign  is  used  if 
A  <  p.  .  The  first  pair  of  estimators  are  consistent  when  A  >  p,  ,  the 
second  pairs  when  A  <  p,  .  Thus,  here  the  estimators  are  consistent  if 
A  =  p  y  but  in  this  case  the  rate  of  approach  to  the  limit  is  n  as 

compared  with  n  for  A  =|  pi  .  Aqso  if  A  =  p  the  probability  that 

the  estimators  are  real  does  not  approach  1  as  n  approaches  infinity, 
although  the  imaginary  parts  converge  to  zero  in  probability.  It  is 
assumed  that  A  4  p,  .  If  it  is  not  known  whether  A  >  pi  or  A  <  pi  , 
then  it  is  also  not  known  which  pair  of  estimators  is  consistent,  that 
is,  which  pair  may  be  expected  to  be  close  to  the  true  value  when  the 
sample  is  large.  (Rider  [25].) 

2 .6  Ulus  trative  Example 

E.  L.  Grant  [11],  has  given  the  example,  shown  in  Table  I  below, 
of  the  distribution  of  numbers  of  surface  defects  in  2,000  pieces  of 
enameled  wave.  The  distribution  is  composed  of  1,000  pieces  from  a 
Poisson  distribution  having  A  =  0.5  and  1,000  pieces  from  a  Poisson 
distribution  having  pi  =  1.05 


TABLE  I 


Number  of 
defects 

0 

1 

2 

3 

4 

5 

6 

7 

TOTAL 

Frequency 

830 

638 

327 

137 

49 

1 

15 

—  1 

3 

1 

2,000 

Simple  claculations  yield: 
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=  1  ,  m^  sb  2.2475 
K  =  6.4775  . 

From  (2.5.1)  the  following  are  obtained: 

s*  =  1.9697 

and  from  (2. 5. 11)  or  (2.5.8)* 

P*  =  0.7222 

It  follows  that 

**  =  0.4948  ;  v*  =  1.4715 
P*  =  0.4844  ;  (1-p*)  =  0.5156 


2.7  Asymptotic  Variances  of  the  Poisson  Estimators 

Rider  [26]  discusses  the  asymptotic  variances  of  the  Poisson 
estimators,  which  are  given  below: 

For  purposes  of  simplification,  it  will  be  assumed  that  the 

.  * 

proportion  p  in  the  mixed- Poisson  distribution  is  known.  Then  A 

■X1 

and  (j.  can  be  obtained  by  using  only  the  first  two  sample  moments. 


It  is  readily  found  that 
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(2.7.1) 


The  asymptotic  variances  of  A  and  p.  may  be  found  by  the 
use  of  a  formula  given  by  Cramer  [10]. 


In  the  notation  of  this  thesis,  this  formula  is 
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var  (n  )  = 


M2(m{) 


*  *  \2 

&L.  > 

J 
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(2.7.2) 
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Here  ^(m^)  and  MgO^g)  are  the  variances  of  mj^  and  respectively, 

Pu(m»  m^)  is  the  covariance  of  these  two  moments,  and  the  partial 
derivatives  are  to  be  evaluated  at  the  point 

m^  =  pA  +  qp 

2  2.  (2.7.3) 

=  p (A.  +  1\  )  +  q(n  +  n  ) 

2  5  2  5 

m^  =  p(A  +  3A  +  A"J  +  q(p.  +  3p.  +  |i^) 

The  values  of  the  coefficients  of  the  partial  derivatives  in 
(2.7.2)  can  be  obtained  by  using  a  formula  given  by  Kendall  [18].  It 
is  found  that : 

H2(®{)  =  n_1  (p£  -  pj2) 

-1  2 

=  n  [pA  +  qp  +  pq(A-p)  ]  (2.7.4) 

where  p  +  q  =  1 

m£)  =  n"1(^  -  p’  H-g) 

-1  2 
=  n  [pA  +  qp  +  p(3-p)A  -  2pqAp  4- 

+  q(3-q)n2]  (2.7.5) 

^2 (m2 ^  “  n"1(k‘i  '  Mg2) 

-I  2 

=  n  [pA  +  qp  +  p(7-p)A  '  -  2pqAp  + 

+  q(7~q)p2  +  2p(3-p)A5  -  2pqA2p  - 

-  2pqAp.2  +  2q(3-q)p^  +  pq(A2  -  p2)2]  .  (2.7.6) 
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The  partial  derivatives  needed  are: 

q1/2(i+2mj) 


&L. 


1  - 


(2.7.7) 


dA_ 

Sm2 


1/2 


(2.7.8) 


At  the  point  (2.7.5)  these  derivatives  have  the  values 


aA_ 


l-r2jl  ,  c)A 
'  dm' 


2p(A-n) 


(2.7.9) 


Substituting  from  (2.7.4),  (2#7*5)>  (2.7*6)  and  (2.7*9)  into  (2.7.2) 


-V  ~ 

and  simplifying  give,  for  the  asymptotic  variance  of  A  , 

[4np2(A-|i)2]“1  [2(pA2+qp2)  +  4pA(A-|a)2  -  pq(A-n)^] 


(2.7.10) 


The  asymptotic  variance  of  p.  can  be  obtained  by  interchanging  p 
and  q  ,  A  and  p,  in  (2.7. 10 )  . 


It  is  the  persomal  opinion  of  Rider  [25]  that  data  should  not 
be  assumed  to  have  come  from  mixed-Poisson  distribution  until  it  has 
been  determined  that  they  have  not  come  from  a  simple  Poisson  distribution. 
That  is,  the  parameter  A  of  the  distribution,  should  be  estimated, 
following  which,  a  Chi-square  test  should  be  made  to  see  whether  the 
data  conform  to  this  distribution.  If  the  hypothesis  that  they  came 
from  a  simple  Poisson  is  rejected,  a  mixed-Poisson  population  may  be 
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Of  course,  the  Chi-square  test  may  give  the  wrong  conclusion, 
in  which  case  it  should  be  impossible  to  find,  by  the  method  under 
discussion,  an  estimate  of  A  .  Even  if  the  population  is  mixed  and 
A  and  p  are  nearly  equal,  it  might  be  difficult  to  obtain  valid 
estimates  of  them. 
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CHAPTER  III 

ASYMPTOTIC  BEHAVIOR  OF  THE  MOMENT  ESTIMATORS 

3.0  Introduction 

Blischke  [3]  has  discussed  the  construction  of  moment  estimators 
for  the  parameters  of  a  mixture  of  two  binomial  distributions.  He  also 
computed  the  limiting  distributions  of  the  estimators  and  their  asymptotic 
efficiency.  In  this , chapter ,  construction  of  moment  estimators  for  a 
mixed-Poisson  distribution  is  given  and  their  asymptotic  properties 
discussed.  The  construction  presented  here  parallels  that  of  Blischke. 

3.1  Construction  of  Moment  Estimators  for  a  Mixture  of  Two -Poisson 
Distributions 

Let  X  ,  X  ,  ....  X  be  independent  and  identically  distrubuted 
1  ^  n  i 

random  variables  each  having  the  distribution 


if  x  =  0,1,2,... 

=  0  otherwise.  (0  <  p  <  1  ;  A,p  >  0) 

where  without  any  loss  of  generality,  we  let  A  >  p  .  Equation  (3.I.I) 
is  a  mixture  of  two  Poisson  distributions.  The  estimators  for  the 
parameters  A,  p.,  and  p  may  be  constructed  as  functions  of  satnple 
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factorial  moments.  It  should  be  noted  that  in  the  following  construction, 
as  in  the  example  considered  by  Rider  [25],  A  <  p  is  essential.  The 
estimators  do  not  have  the  stated  properties  if  A  =  p  .  (Note  that 
it  is  actually  required  only  that  A  =j=  p  )  the  particular  assumption 
that  A  <  p  is. for  notational  convenience.) 

th 

The  k  factorial  mement  of  x  can  be  written  as 


m[k]  =  ^  +  h"p)  ^ 


(3.1.2) 


/,k  kN  k 

=  P(A  -  |i  )  +  H 


we 


To  obtain  estimators  for  the  three  parameters  A,  pi,  and  p 
equate  esimates  f^,  f^,  and  f  of  the  first  three  factorial 


moments  -jj  >  m[2]*  aRd  m[3]  resPecti-ve^y  °f  a  random  sample  from 

*( 

a  distribution  with  probability  function  [3.1.1)  to  the  corresponding 
moments  (3.1.2).  Let  p,  A,  p  are  the  estimators  of  p,  A,  p 
respectively  Thus  we  have 
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,  A  3  A  3s  A 3 
f5  =  P  (A  -  p  )  +  p 
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A 

fk  =  )  x(x-l)  . . .  (x-k+1)  — 


x=o 


Where  R  is  the  largest  observed  value  of  x,  is  the  sample 

frequency  of  x,  and  n  is  the  total  sample  size; 


n  = 


R 


x=o 


n 


x 


Now 


fk  =  n  VV15  •••  (Vk+1)  • 


It  is  easily  seen  in  fact  that 


^  <^k  .  a  .  Ak 

f,  =  P  A  +  (I-P)  H 


(3.1.3) 


In  constructing  estimators,  the  moments  given  in  (3.1-3)  are  considered 
as  equations  in  three  unknowns  p,  A,  jj.  .  Any  three  such  equations  may 
be  solved  for  the  three  parameters.  We  shall  consider  f^,  f^,  and 
f  o  Note  that  by  equation  (3.1.3)> 
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so  that 


flf2 


* 

=  S  ,  say  . 


Now  from  f 


1  ' 


A 

P 


fl 


'x 


A 


(5.1.4) 


which,  when  substituted  into  f^  ,  yields 


-  /.  An  *  A2 

f2=  (fi“J^)  s  +  ^ 


Thus 


/V 2  *  A  * 

P-  “  S  jj,  +  S  f  -  f  =  0 


■(3.1.5) 


A  A 

Solving  for  A  instead  of  p  yields 


A  p  *  A  * 

X  -  S  X  +  S  f  -  f2  =  0 


Thus  the  restriction  that  X  <  \i  results  in  the  unique  solution 


^U±s*±i[s*  .w*f1+Mo]V2 


(5-1.6) 


which  together  with  equation  (3.1.4)  expresses  X,  p.,  and  p  as  functions 


of  f  f  ,  and  f  respectively. 
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3.2  Asymptotic  Behavior  of  the  Moment  Estimators 

*  •$(-  Hr 

The  asymptotic  joint  distribution  of  A  ,  p.  ,  p  is  obtained 
by  an  extension  of  a  result  given  by  Cramer  [10],  Cramer  proves  a 
theorem  concerning  the  limiting  distribution  of  a  function  of  two  central 
moments.  Extended  to  several  functions  and  to  functions  of  factorial 
moments,  the  result  is  the  following: 


THEOREM:  Let  H.(F.  ,  ..., 

1  J1 

K-sample  factorial  moments, 


0  0  0 


>*,  )-(*,» 

Jk  J1 


0  0  9  ) 


F.  )  ,  i  =  l,...,r,  be  r  functions  of 
"k 

continuous  in  some  neighbourhood  of 

)  =  f  ,  say,  and  having  contiguous  first 

k 


and  second  order  partial  derivatives  with  respect  to  the  arguments 

Fj  .  Then  the  random  variables  H^,...,Hr  are  asymptotically 

1  k 


jointly  normally  distributed  with  respective  means  H- (f  ,  ...,f,  ),... 

1  jl  Jk 

H  (f.  ,...,f.  )  and  covariance  matrix  2ji  =  (cr  , )  having 
r  J1  Jk  11 


entries 


f 


k 


Cramer  proves  the  theorem  for  central  moments  and  for  r  =  1  .  For 
r  =  1  ,  the  argument  given  by  Cramer  applies  directly  in  the  case  of 
factorial  moments  since  the  convergence  properties  of  central  moments 
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required  in  the  proof  hold  also  for  factorial  moments.  The  extension  to 
r  >  1  is  obvious  in  either  case. 

The  conditions  of  the  theorem  are  easily  seen  to  hold  for 
F2,  F3)  =  \*  ,  H2(Fx,  Fg,  Fj)  =  p*  and  HJ(F1,  F£,  F?)  -  p*  . 

#  ^ 

Thus  A  ,  p  ,  p  are  asymptotically  jointly  normally  distributed  with 

means  A,  p,  and  p  respectively.  To  compute  the  covariance  matrix 

of  the  asymptotic  normal  distribution,  the  covariance  matrix  of 

(F. ,  F_,  F_)  and  the  first  order  partial  derivative  dH,/dF.  evaluated 
125  l  J 

at  f  =s  (f^,  fg,  f^)  are  required  for  i,J  ■  1,2,3  • 

The  moment  generating  function  of  the  distribution  [Equation 
(3.1.1)]  Is 

m(t)*p  e"^1"®  ^  +  (1-p)  e*^1'®  ^  (3.2.1) 

From  equation  (3.2,1)  the  necessary  moments  of  the  F^  are  obtained. 

By  equation  (3.1. 3)  these  can  be  written  as 
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•‘af'i)  ■  =  [f1*(“-1)f2]  -j;(£i>2 

MV  =  ^Tj  [2f2-^(n-2)f5+(n_2)(n"5)fi4-]  ‘  rn  (f2)2 

MV  ‘  mnTn-'l')'(n'-2)  [6f5+l8(n-3)  £^(n-3)  (n-4)  £,. 

+  (n-3) (n-4) (n-5)fg]  -  j  (f ?)2 

>*MF1>V  “  =  [2f2+(n-2)f3]  -  i  fx£2 
“ll(,l»,3>  “  £  t3£3+(n-3)f4]  -  ±  ^ 

[8£3+6(«-3)V(n-5)(nA)£5]  ‘  m  f2f3 

The  required  partial  derivatives  are  not  difficult  to  compute: 
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_ JL 
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In  matrix  form  the  derivatives  are 
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+  "nm  [ pA2-f-(l.~p)|_i2 ]  -  ^  [pA+(l-p)n2]  |» 
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+  [p^+d-p)^]  +  [p^5+(i-p)p5] 

-  ^  [p^2+(i-p)ii2][p^+(i-p)ji3]  j-  . 

To  simplify  this  expression,  write 

(n-2)(n-3)  =  n(n-l)  -  4(n-l)+2 

(n-3)(n-4)  =  (n-1 ) (n-2 )  -  4(n-2)+2 

=  n(n-l)  -  6(n-l)+6 

(n-3) (n-4) (n-5)  =  n(n-l)(n-2)  -  9(n-l)(n-2)  +  l8)n-2)-6  , 

etc.,  and  collect  terms  having  common  denominators.  For  example,  the 

2  4-1 

terms  independent  of  n  (ignoring  the  common  factor  [mp  (A-p)  ]  ) 

sum  to 

{pA2+(l-p)p2  -  [pA+(l-p)p.]2}  (j.2(p+2A)2 
+  {pA^+(l-p)|i^  -  [pA2+(l-p)(i2]2}  (A+2p.)2 
+  pA6+(l-p)(i6  -  [pA^+(l-p)^]2 
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2  4-1 

of  the  terms  that  remain,  those  having  [mnp  (A-p)  ]  as  a  factor  sum 
to  pA(l-A)  (A-p)^  ,  etc. 


After  some  computation,  the  elements  of  the  asymptotic 
covariance  matrix  of  A  ,  p  ,  p  are  in  this  way  found  to  be 
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3* 3  Asymptotic  Relative  Efficiency  of  the  Moment  Estimators 


The  asymptotic  relative  efficiency  (ARE)  of  a  consistent 

A 

asymptotically  normally  distributed  moment  estimator  9  of  a  parameter 

"ft 

9  relative  to  the  corresponding  maximum  likelihood  estimator,  say  6  , 

may  be  computed  as  (Cramlr  [10]) 
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2  2 

where  c  *  is  the  Cramer«=Rao  lower  bound  and  cr  /m  is  the  variance 

0  9 


A  -  /^  //>  A 


in  the  limiting  distribution  of  9  If  9  =  (0^,...©r)  is  an  estimator 

.  .  A 

of  a  vector  parameter  9  =  (0^,.„.,0r)  with  the  components  of  9 

asymptotically  jointly  normally  distributed  with  mean  9  and  covariance 


matrix 


(“)  3 

hir  “ 
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the  asymptotic  relative  efficiency  may  be  computed  in 

A 


this  way  for  each  component  of  Q  or  the  components  may  be  considered 
jointly.  In  this  latter  case,  the  joint  asymptotic  relative  efficiency 
(JAKE)  of  *9  relative  to  the  maximum  likelihood  estimator  9  is  computed 
as  the  square  of  the  ratio  of  the  areas  of  the  ellipses  of  concentration  of 
the  respective  asymptotic  normal  dis txibutions  .  (Cramer  [10],  Ch  .  J>2) 

Since  the  areas  of  the  ellipses  of  concentration  are  proportional  to  the 
determinants  of  the  respective  covariance  matrices. 
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The  Cramer-Rao  lower  bound  is  determined  by  the  relation 
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=  E 


d  lpS  P(x)  d  lQg  p(x) 
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(3.3.2) 
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where  P(*)  is  given  in  equation  (3.1.1).  The  elements  of  the  inverse 


of  ^  ^  for  the  example  under  consideration  are  easily 
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computed,  e.g.,  the  upper  left-hand  element  of  *  *  *  is 
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Equation  (3-3 .6)  is  the  Cramer-Rao  lower  bound  covariance  matrix  for 
estimating  the  parameters  of  a  mixture  of  two  Poisson  distributions. 

3.4  Estimation  of  the  Poisson  Parameters  When  p  (proportion)  is  known 

We  again  assume  the  A  <  jj.  .  Note  that  in  Section  (3*1)  this 
assumption  was  not  restrictive.  In  the  present  case,  however,  this  can 
be  a  very  real  restriction.  Unless  p  =  1/2  ,  it  is  not  sufficient 
when  p  is  known  to  simply  that  A  =j=  jj,  ;  it  must  be  known  specifically 
that  p  is  the  proportion  in  which  the  population  having  smaller  mean 
is  present  in  the  mixture  (see  Rider’s  comment  [25]  on  this  problem). 

Under  this  assumption  a  development  similar  to  that  of  Section 
(3.1)  yields  as  functions  of  the  first  two  factorial  moments  the  moment 
estimators 


(3.U.1) 


These  estimators  are  efficient  for  n  =  2  .  We  shall  now  show,  however, 
that  the  asymptotic  relative  efficiencies  of  the  estimators  of  equation 
(3.4.1)  tend  to  0  rather  than  to  1  as  n  -» 00  . 
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So  that  by  Cramer’s  theorem, 
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In  this  case,  the  Cramer-Rao  lower  bound  covariance  matrix  for  estimating 
the  parameters  of  a  mixed-Poisson  distribution  is 
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3.5  Mixtures  of  Three  or  More  Poisson  Distribut ions 


Introductory  Remarks 

We  shall  now  consider  the  estimation  problem  for  a  mixture  of 
an  arbitrary  number,  r  ,  of  Poisson  distributions.  Let  X^,...,Xm 
be  independent  and  identically  distributed  random  variables,  each 
having  the  distribution  (Blischke  [4]) 
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all  parameters  (except  n  )  unknown.  Equation  ( 3 • 5 • 1 )  is  a  mixture 

of  r  Poisson  distributions.  In  this  case  there  are  (2r-l)  parameters 

to  estimate.  Estimators  for  these  (2r-l)  parameters  can  be  constructed 

by  the  methods  of  section  (3.1)  as  functions  of  the  sample  factorial 

moments  F^, F^, . . . , Fg  ^  ,  defined  in  equation  (3.1.2).  In  this  case 
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The  verification  of  equation  (3.5*2)  is  identical  with  that  of 
equation  (3.1*3)*  Thus  the  moment  estimators  are  constructed  by  solving 

f°r  9  Pl,#,*,Pr-l  and  substituting  F^s 

for  f  '  s  in  the  solution.  We  shall  show  that  r  >  k  ,  this  operation 

K. 

will  result  in  a  polynomial  in  one  of  the  parameters  of  degree  five  or 
more.  Such  a  polynomial  cannot  be  solved  algebrically,  so  that  the 
estimators  cannot  be  written  explicitely  but  only  as  roots  of  certain 
polynomials . 


Construction  of  the  Moment  Estimators 


To  illustrate  further  the  method  of  attack  required  in  computing 

estimators  for  mixtures  of  Poissons,  we  shall  now  outline  the  construction 
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Ti , ,X  ,  p  ,  and  p  is  a  proof  of  identifiabillty  for  n  >  2r-l  =  5  . ) 
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yields 


-  < Jii 


' 

, 

.  :  ■  :  ■  ■  .  •'  9  * 


"  ‘  f 


snllui fcradwa  bnfi  x~*rqt*  *  *  'lq  4  7o1  f-iS5**** 


aoJtie jiaqio.  :lJ.  t  4  <  x  2arf3  v  jfi®  II?..-:?  »W  .flois;  Coa  vb  ...  8*  ,i  xoi 

>  **  -  .  r  r  . 


io  avli  aaigab  io  ai93a0faiaq  »rf3  io,  »iro  nl  laiflrofr'ioq  a  nl  lloaal  IIlw 


*  ,  - .  .  ”,  . 

.  ■?*  ■■  J  a  y  ior.  -  .  /  drn.fr;  .  v:.-v' 

.•v  O  :  :  .  :  n--:  O  . r.  -  O-  '  V  r}  S 


.<  „ 


.  -3  i  •'  loq 


■ 


io  v  -  -  .  j 3aarnoM  s.'b  io  7  :■  vsj  bv.vO 


.  ■■(■i  -  ■  ‘:i?  in  •■  arh  is/biui  siailej  III  oT 


'•  •  :■  *{  ■■  i-  ■  <  .  i ;  as 


bsnisico  sd  nao  Bio3aflrJ:3«5  jij&f  8889  ftlrf.2  n;}  .  £  »  i  9889  ani  io  i 


.* 


ioi  16  noliuXoB  *uplnu  8  io  sanadaixa  arid  be  a  %la3ldllq*9 


"IJtdalil  r  a  «Jk  _q  fc’i 

X  C.  a  X 


1 

'?  .  •  .  r 


<3-  '  •  n* 

u. 


rtlaJdo  oJ  , q  fcna  ^q  lei  c x  bna  svice  :  aaift 


{£•?•?) 


. 


■  ;K.  • 


AM 


t1 


■  r 


:  f  ..  ■  .  '  ' 

.  . 


axx  *J  olcii  (4.£,£)  baa  (c.<!..0  anoldaupa  5p  noi3x*3.7 


x  *  *■■ 

:.  v 


-  kl  - 


f3  =  pl(ArA3}  +  P2(A2“A3}  +  A3 

=  Xi-Ap  t  (f2-A2fl"A3fl+A2A3^Al+AlA3+A3^ 

'  (f2‘Alfl'A3fl+AlA3^A2+A2A3+A3^  +  A3 
*  (f2-A5f1)  (X1+X2+X5)  -  x1xg(£1-?'j) 

+  ^(f^j)  +  Aj 

=  (Al+X2+?V3)f2  ‘ 

+  A1A2A3 

Similarly, 

f4  =  pi(Ai'Aj)  +  p2^A2"A3^  +  X3 

“  XTA2  t(f2'?'2fr?'j£l+A2A3)(?Vl+?'l?'3+?'lA3+A5) 

+  (fg-^f  ^-hAiA3  )  (^2+A2A3+A2A3+A3^  +  AJ 

=  (Al+A2+A3^3  “  ^A1A2+A1A3+A2A3^2  +  A1A2A3^1 

From  equation  (3.5.5) 

f  -A  f  -A  f  +A  A  f. 

\  =  -i— 5-2 _ 2  2 . 2  31 

1  f  -A0f . -A,f . +A0A_ 

2  2  1  3  1  2  3 


(3.5.5) 


(3.5.6) 


Substituting  this  into  equation  (3»5»6)  to  obtain 


-  JU*  - 


' 


r,:  C, 


i 


£ Vi^<eW&**W  p£v 

•  »  . 

jjp^  +  (  vA+>  v  /  ^  A4>  j3\g.  A  -  »  ,?i)  - 


f_ 


■u  *  '  r  fc  \  1  •  /• 

* 


sV 

- 


(?•?•{.) 


tr 


A  A  A  - 

{  2  iM 


t  .;*:••.!  1  *::a 


v.. 


f. 


. 

'  ■€  ■  .-  ^ 

■ 

«/  ’  '. 


(S-  -f.) 


'.  • 


' 


y 


■ 

,•  ’  '  ':  i  A  c~: 

V 


■ 


K 


V 


■■■'  - .  ■  -3  c  ...  .  •  i- .  >s  'O  ■  ■  ..  :  '  l  ..:  -  ■■■:  J  u3 


r 


Nf. 


-  1*2  - 


or 


(f>  -A  f  -A  f  +A  A  f  )  (f  -A  f  -A  f  +A  A  )  = 

4  <1  3  3  3  c-  J  <-  d  2  1  3  I  2  3 

=  (f  -A  f  -A_f0+A  A  f  )2 

J5  d  <:  J  d  d  3  1 


°  -  (£l£3-X3£3+A3f2-f2-A3f?+X3flf2)X2 

+  (A  -Xjf  -X  £g+Aj£^£g)X2 

+  f£f 


=  Q^j)  •  \  +  Q2C^)  •  \  +  Q3(*j) 


(3-5.7) 


say.  As  in  the  solution  for  two  Poissons,  elimination  of  A^  from 
equations  (3»5»^)  and  (3»5*5)  will  result  in  equation  ( 3 - 5 • T )  with  A^ 
replacing  A^  .  Thus  the  two  roots  of  the  equation 
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f5  =  pi(ArX^  +  p2^2~*P  +  A3 
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CHAPTER  IV 

MIXED  TRUNCATED  POISSON  DISTRIBUTIONS 
WITH  MISSING  ZERO  CLASSES 

4.0  Introduction 

The  first  three  sample  moments  were  employed  by  Rider  [26] 
in  obtaining  explicit  estimators  for  parameters  of  mixtures  of  two 
Poisson  distributions.  In  1963,  Rider's  results  were  extended  by 
Cohen  [9].  The  simplification  is  achieved  through  use  of  factorial 
rather  than  ordinary  moments.  The  extensions  involve  consideration 
of  alternate  estimators  for  the  case  considered  by  Rider  [26],  using 
the  first  two  sample  moments  plus  the  zero  sample  frequency;  mixtures 
of  truncated  Poisson  distributions;  and  some  factors  involved  in 
deciding  when  sample  data  are  indeed  from  a  mixed  rather  than  from 
a  non-mixed  distribution  (Cohen  [9]). 

4.1  Estimates  Based  on  First  Two  Sample  Moments  and  Zero  Sample 
Frequency 

Let  A  and  jj,  be  the  parameters  of  the  mixed -Poisson 
distribution  that  have  been  mixed  in  the  proportions  p  and  (l~p) 
respectively.  The  probability  function  of  the  mixed  distribution  is 
then 

X  X  o»  |  j 

g(x)  =  P  “7 —  +  U“P)  ^7 -  >  x  =  0,1,2,...  (4.1.1) 
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where  without  any  loss  of  generality,  we  let  A  >  p  .  The  k  factorial 
moment  of  x  can  be  written  as 


m[k]  =  p7yk  +  (1-P)^k 

The  estimating  equations  are 


x  -  n  =  p(A-n) 

2  (\2  2  \ 

V[2]  -  li  =  P(A  -|a  ) 

“  -  e"^  =  p[e”A  -  e“^]  (4.1.2) 

where 

R 

Zn 

x(x-l)...(x-k+l)  —  (4.1.3) 

x=o 

in  which  R  is  the  largest  observed  value  of  x  ,  n  is  the  sample 

x 

frequency  of  x  ,  nQ  is  the  number  zero  observations  in  the  sample  and 
n  is  the  total  sample  size;  that  is 

R 

(n-  I  n0  • 

x=o 


On  eliminating  p  between  the  first  two,  and  between  the  first  and 
third  equations  of  (4.1.2),  we  have 
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A  = 


v[2]  -  X  p 


/  (x-p)  «  G(p) 


(x~p)/(A-p)  = 


r  n 

O  -Ll 

—  -  e 
n 


/ 


-A  -p 

e  -  e 


(4.1.4) 


When  A  =  G(p)  is  substituted  into  the  second  equation  of  (4.1.4),  we 
obtain 


[x-p]/[G(p)-p]  * 


r  n 


o  -p 
—  -  e 


n 


] 


/ 


e-G(n)  .  e-H 


(4.1.5) 


which  can  be  solved  for  (a,  using  standard  iterative  procedures.  With 
p  determined  from  (4.1.5),  A  follows  from  the  first  equation  of 
(4.1.4)  as 


V[2]  -  X  * 


—  *> 


/  (X-H  ) 


(4.1.6) 


77  /  \ 
and  p  follows  from  the  first  equation  of  (4.1.2)  as 


*  *  *  * 
P  =  (x-p  )/(A  -p  ) 


(4.1.7) 


4.2  Mixed  Truncated  Poisson  Distributions  with  Missing  Zero  Classes 

In  this  case,  the  probability  function  of  the  mixed  distribution 
is  written  as 


-v  x  -A 
A  e 


x  -p 


f(x)  =  p  - 6  -V  +  (1-p)  — ^  6  -  ,  X  =  1,2,... 

x!  (l-e  )  x!  (l-e  ^) 


(1-e^) 


(4.2.1) 
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th 

The  k  factorial  moment  is 


m 


[k] 


=  p  — :x  +  (i-p) 
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The  estimating  equations  resulting  from  equating  sample  moments  to 
corresponding  distribution  moments  (4.2.2),  in  this  case,  are 


r  a 

L  “  "  1-e^  J 

-  P 

i  “A  - 
L  1-e 

i  “M- 

1-e  -l 
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[21  l-e'M 


=  P 


y2  2 

A  p, 
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_  1-e  l-e"^  - 


rv  --uL-i 

=  P 

ZP'  1-e"^  J 

(4.2.3) 


If  the  first  of  the  above  equations  is  multiplied  by  p  and  subtracted 
from  the  second,  and  then  the  second  is  multiplied  by  p  and  subtracted 
from  the  third,  the  following  equations  are  obtained. 


-A, 


V[2]  ’  H  x  =  pA(A-p)/(l-e~  )  , 

V[3]  “  ^V[2]  =  pA^(A-p)/(l-e-^) 


(4.2.4) 
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When  the  second  equation  of  (4.2.4)  is  divided  by  the  first,  the  result 
is 


A  = 


V[ 3]  ~  ^[2]  


Vmi  -  [X  x 


H(n) 


(4.2.5) 


[2] 


Then  elimination  of  p  between  the  first  two  equations  of  (4.2.3), 
yields 


1 

1  x 

1 _ J 
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_  l-e"A  1-e”*1  J 

(4.2.6) 


Finally,  when  A  in  (4.2.6)  is  replaced  by  H(p)  ,  the  result  is  the 
following  equation  in  p  only. 


x  - 


1-e 


-p 


v[£]  '  I--*  J 


H(p)  p 

H£(|i)  H2 

-  -H(p)  ”  ,  -p 

L  1-e  'r/  1-e  ^  -J 

-  l-e”H^  1-e”*1  - 

(4.2.7) 
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Equation  (4.2.7)  can  be  solved  for  p  only  using  standard  iterative 
procedures.  With  p  determined  from  (4.2.7),  A  may  be  obtained 
from  (4.2.5)  as 
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and  p  may  be  obtained  from  the  first  equation  of  (4.2.4)  as 


* 

P 


r  —  1 

x  -  A 

/ 

X* 

*  1 
P  . 

* 

* 

L  i  J 

L  i 

(^.2.9) 


4.5  Which  Distribution  is  Applicable 

Whenever  it  is  known  in  advance  of  selecting  a  sample  that 
one  of  the  mixed  distributions  discussed  in  the  preceeding  section  of 
this  chapter  is  applicable,  then  it  is  relatively  simple  task  to 
calculate  estimates  using  the  appropriate  estimating  equations.  In 
the  more  general  situations,  it  is  necessary  to  determine  from  the 
sample  data  which  distribution  most  adequately  accounts  for  the 
observations  at  hand.  The  latter  problem  is  more  difficult  to  deal 
with  and  much  remains  to  be  learned  concerning  its  solution. 

In  this  section,  we  consider  the  following  illustrative 
example  employed  by  Rider  [26]  . 
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TABLE  II 


x 


n 


x 


0 


85O 


1 


638 


2 


327 


137 


49 


5 


15 


6 


3 


7 


1 


After  a  casual  examination  of  these  data,  one  might  be  led 


to  suggest  that  they  originated  with  an  ordinary  Poisson  distribution 
whereas  in  fact  they  represent  a  mixture  of  equal  proportions  of  two 
Poisson  distributions  with 

A  =  1.5  and  p.  =  0.5 

respectively.  An  indication  of  the  extent  to  which  a  simple  Poisson 
fits  these  or  similar  data  might  be  obtained  by  calculating  the  set  of 
estimates 


(4.5.1) 


and  the  set  of  estimates  A 


for  which 
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The  calculation  of  the  A  presents  no  difficulties,  and  A  can  be 
read  directly  from  the  tables  of  the  Poisson  function.  These  estimates 
for  Rider’s  example  are  given  in  the  table  below. 


Sample  moments  for  these  data  are 

x  =  0.9995  ;  v[2]  =  1.248  ;  v[3]  =  1.734 


TABLE  III 

DISTRIBUTION  OF  SURFACE  DEFECTS  IN  2000  PIECES  OF  ENAMELED  WARE 


X 

n 

X 

V" 

X* 

■** 

A 

0 

830 

0.4150 

0.879 

0.769 

1 

638 

0.3190 

0.556 

1.025 

2 

327 

0.1635 

0.894 

1.257 

3 

137 

0.0685 

1.06 

1.431 

4 

49 

0.0245 

1.17 

1.531 

5 

15 

0.0075 

1.26 

1.200 

6 

3 

0.0015 

1.24 

2.333 

7 

1 

0.0005 

1.39 

TOTAL 

2000 
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The  lack  of  consistency  among  the  various  estimates  A  and 

** 

A  clearly  suggests  that  these  data  cannot  be  satisfactorily  fitted 
by  the  ordinary  Poisson  distribution.  The  mixed-Poisson  distribution 
provides  an  almost  perfect  fit  for  these  data,  and  estimates  computed 
by  substituting  applicable  sample  values  into  equations  (2.7.I)  and 
(2.5.5)  are 

A*  =  1.4767 

p*  =  0.4777 

and 

p*  =  0.5224  . 
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